Catastrophic Fault Recovery with Self-Reconfigurable Chips
نویسندگان
چکیده
Mission critical systems typically employ multistring redundancy to cope with possible hardware failure. Such systems are only as fault tolerant as there are many redundant strings. Once a particular critical component exhausts its redundant spares, the multi-string architecture cannot tolerate any further hardware failure. This paper aims at addressing such catastrophic faults through the use of “Self-Reconfigurable Chips” as a last resort effort to “repair” a faulty critical component.
منابع مشابه
Error Recovery in Critical Infrastructure Systems
Critical infrastructure applications provide services upon which society depends heavily; such applications require survivability in the face of faults that might cause a loss of service. These applications are themselves dependent on distributed information systems for all aspects of their operation and so survivability of the information systems is an important issue. Fault tolerance is a key...
متن کاملSelf-Repairing Algorithm with Shared Spare Allocation for Reconfigurable Systems
Self–repairing digital systems have received increased attention as the modern systems are becoming more complex and fast. For systems operating in harsh and/or hostile environments even a single failure event can result in huge loss and disastrous effects. Availability of a system can be increased by making it capable of detecting and recovering from faults, i.e. make it fault tolerant. In thi...
متن کاملCharacterization of catastrophic faults in two-dimensional reconfigurable systolic arrays with unidirectional links
The catastrophic fault pattern is a pattern of faults occurring at strategic locations that may render a system unusable regardless of its component redundancy and of its reconfiguration capabilities. In this paper, we extend the characterization of catastrophic fault patterns known for linear arrays to two-dimensional VLSI arrays in which all links are unidirectional. We determine the minimum ...
متن کاملDisjoint Covers in Replicated Heterogeneous Arrays
Reconfigurable chips are fabricated with redundant elements that can be used to replace the faulty elements. The fault cover problem consists of finding an assignment of redundant elements to the faulty elements such that all of the faults are repaired. In reconfigurable chips that consist of arrays of elements, redundant elements are configured as spare rows and spare columns. This paper consi...
متن کاملFormal Probabilistic Analysis of Stuck-at Faults in Reconfigurable Memory Arrays
Reconfigurable memory arrays with spare rows and columns are quite frequently used as reliable data storage components in present age System-on-Chips (SoCs). The spare memory rows and columns can be utilized to automatically replace rows or columns that are found to contain a cell fault after fabrication. One of the biggest SoC design challenges is to estimate, prior to the actual fabrication p...
متن کامل